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FIELD OF THE INVENTION 

The present invention relates to a video encoding method provided for encoding an 
input image sequence consisting of successive groups of fiames in which each frame is itself 
subdivided into blocks, said method comprising the steps of : 

- preprocessing said sequence on the basis of a so-called content-change strength 
(CCS) computed for each frame by appljdng some predetermined rules ; 

- estimating a motion vector for each block of the firames ; 

- generating a predicted fiame using said motion vectors respectively associated to the 
blocks of the current fiame ; 

- applying to a difference signal between the current fi:ame and the last predicted 
frame a transformation sub-step producing a plurality of coefficients and followed by a 
quantization sub-step of said coefficients ; 

- coding said quantized coefficients. 

Said invention, applicable to video encoding devices that require reference frames for 
reducing e.g. temporal redundancy, like motion estimation and compensation devices, may 
be used for instance in devices like digital video cameras, mobile phones or digital video 
recording devices. Furthermore, applications for coding or transcoding video can be 
enhanced using the technique according to the invention. 

BACKGROUIVD OF THE INVENTION 

In video compression, low bit rates for the transmission of a coded video sequence 
may be obtained by (among others) a reduction of the temporal redimdancy between 
successive pictures. Such a reduction is based on motion estimation (ME) and motion 
compensation (MC) techniques. Performing ME and MC for the current fi^e of the video 
sequence however requires reference frames. Taking MPEG-2 as an example, different 
frames types, namely P- and B-fiBmes, have been defined, for which ME and MC is 
performed differently : I-fiames (or intra frames) are independently coded, without any 
reference to past or future frames (no ME and MC is performed), while P-fiames (or forward 
predicted pictures) are encoded relatively to past frames and B- frames (or bidirectional 
predicted frames) are encoded relatively to two reference frames (a past fiame and a future 
frame). The I- and P*fi:ames serve as reference fiames. 
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In order to obtain good ftame predictions, these reference ftames need to be of high 
quality, i.e. many bits have to be spent to code them, whereas non-reference ftames can be of 
lower quaUty (for this reason, a higher number of non-reference frames, B- ftames in the case 
of MPEG-2, generally lead to lower bit rates). In order to indicate which input frame is 
processed as an I- frame, a P-frame or a B- frame, a structure based on groups of pictures 
(GOPs) is defmed in MPEG-2. More precisely, a GOP uses two parameters N and M, where 
N is the ten^oral distance between two I-frames and M is the temporal distance between 
reference frames. For example, an (N,M)-GOP with N=12 and M=4 is conmionly used, 
defining an"IBBBPBBBPBBB" structure. 

Succeeding ftames generally have a higher ten^oral correlation than frames having a 
larger temporal distance between tiiem. Therefore shorter temporal distances between the 
reference and the currently predicted firame on the one hand lead to higher prediction quality, 
but on the ofhsr hand imply that less non-reference frames can be used. Both the higher 
prediction frames generally-resnltiir lower bi^ - 

rates, but they work against each other since the fi:ame prediction quality results fix)m shorter 

teii$>oral distances only. 

However, said quality also d^ends on the usefulness of the reference frames to 
actualbr sctvc as references. For exaniple, it is obvious that widi a reference frame located 
just before a scene change, flie prediction of a frame located just after the scene change is not 
possible with respect to said refisrence ftame, although Ihey may have a fiame distance of 
only 1. One the other hand, in scenes with a steady or ahnost steady content (like video 
conferencing or news), even a ftame distance of more than 100 can stiU result in high quality 
prediction. 

From the above-mentioned examples, it appears that a fixed GOP structure as the 
conmionly used ( 12, 4 )-GOP may be inefficient for coding a video sequence, because 
reference ftames are introduced too frequently, in case of a steady content, or at a unsuitable 
position, if they are located just before-a scene change. Scene-change detection is a known 
technique that can be e;q>k>ited to introduce an I-ftame at a portion where a good prediction 
of the ftame is not possible due to a scene change, but some sequences do not profit fi?om 
such techniques if ttie ftame ccmtent is ahnost completely different afto some ftames having 
high motion, with however no scene change at all (for instance, in a sequence whare a tennis 
player is continuously followed within a single scene). A previous European patent 
appUcation, akeady filed by the appUcant on October 14, 2003, with tiie filing number 
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03300155.3 (PHFR030124) has then described a new method for finding better reference 
fiames. 

SUMMARY OF THE INVENTION 

It is the object of the invention to propose a method based on said previous method 
for finding good reference fiiames, but allowing to reduce more noticeably the coding cost. 

To this end, the invention relates to a video encoding method such as defined in 
the introductory paragraph of the description and in which said CCS is used in said 
quantization sub-step for modifying the quantization factor, said CCS and the quantization 
factor increasing or decreasing simultaneously. 

The invention also relates to a device for implementing said method. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will now be described, by way of exan^le, with reference to 
the accompanying drawings in which : 

- Fig. 1 illustrates the rules used, according to the description given in the previous 
European patent application cited above, for defuiing the place of the reference fi^es of the 
video sequence to be coded ; 

- Fig.2 shows an encoder carrying out the method described in said previous European 
patent applicatio n in the MPEG encoding case ; 

- Fig-3 shows an encoder carrying out the method accordmg to the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The document mentioned above describes a method for finding which frames in the 
input sequence can serve as reference fi:ames» in order to reduce the coding cost The 
principle of this method is to measure the strength of content change on the basis of some 
simple rules, such as listed below and iUustrated in Fig.l : the measured strength of content 
change is quantized to levels (for instance 5, said number being however not a limitation), 
and I- flames are inserted at the beginning of a sequence of frames having content-change 
strength (CCS) of level 0, while P-fiames are inserted before a level increase of CCS occurs 
or after a level decrease of CCS occurs. The measure may be for instance a simple block 
classification that detects horizontal and vertical edges, or other types of measures based on 
luminance, motion vectors, etc. 
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An implementation of this previous method in the MPEG encoding case is described 
in Fig.2. The encoder comprises a coding branch 101 and a prediction branch 102. The 
signals to be coded, received by the branch 101, are transformed into coefficients in a DCT 
module 1 1 and quantized in a quaiiization module 12, the quantized coefficients being then 
coded in a coding module 13, together with motion vectors MV. The prediction branch, 
receiving as input signals the signals available at the output of the quantization module 12, 
comprises in series an inverse quantization module 21, an inverse DCT module 22, an adder 
'23, a ftaine memory 24, an MC circviit 25 and a subtracter 26. The MC circuit 25 also 
receives die motion vectors MV generated by a ME circuit 27 (many types of motion 
estimators may be used) from the input reordered frames (defined as explained below) and 
the output of die frame memory 24, and these motion vectors are also sent towards the coding 
module 13, die output of vMch ("MPEG output") is stored or transmitted in die form of a 
multiplexed bitstream. 

The iad«5 mpur6rffie"encoaer(successi^e-frames-X^^ 

preprocessing branch 103. First a GOP structure defining circuit 3 1 defines from die 

successive frames die structure of the GOPs. Frame memories 32a, 32b, are tiien 

provided for reordering die sequence of I, P, B frames available at tiie output of tiie circuit 31 
(die reference frames must be coded and transmitted before the non-reference frames 
depending on said reference frames). These reordered frames are sent on die positive input of 
the subtracter 26 (die negative input of which receives, as described above, die ou^ut 
predicted frames available at die output of die MC circuit 25. diese output predicted frames 
being also sent back to a second input of tiie adder 23). The output of die subtracter 26 
delivers frame differences diat are the signals processed by die coding branch 101 . For die 
definition of die GOP structure, a CCS computation circuit 33 is provided. 

It has dien been observed fliat die higher die CCS - which can result from motion - 
die less die viewer can really follow die presented video. It is consequendy proposed, 
according to die present invention, to increase or decrease die quantization fector used in the 
module 12 as a ftmction of tiie CCS - said CCS and die quantization factor increasing or 
decreasing simultaneously - which can be obtained by sending die output information of die 
CCS computation circuit towards the quantization module 12 of die coding branch. It is 
known, indeed, tiiat die coding module 13 comprises in feet, in series, a variable- lengdi 
coding (VLC) circuit 131 and a buffer memory 132, die output of said memory being sent 
back towards a rate control circuit 133 for modifying die quantization fector. 
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According to the invention, and as shown in Fig J in which similar circuits are 
designated by the same references as in Fig.2, an additional connection 200 intended to allow 
to implement the proposed modification of quantization factor is provided between the CCS 
computation circuit 33 and the rate control circuit 133 and also between said circuit 33 and 
5 the quantization module 12 of the coding branch. This connection 200 extends two coding 
modes of the coding system, namely a so-called open- loop coding mode (without bit-mte 
control) and a closed loop coding mode (with bit-iate control). 

In the open- loop coding mode for example, the quantizer settings are usually fixed. 
The resulting bit rate of the encoded stream is automatically lower for simple scenes (less 

10 residue needs to be coded) than for complex scenes (higher residue needs to be coded). 
Coding cases as described above, where the sequence contains high motion, result in 
complex scenes that are coded with high bit-rates. The bit-rates for the high- motion scenes 
can be reduced by higher quantization, thereby removing spatial details of these scenes that 
the observer cannot follow due to the motion. The quantization can be controlled by defining 

15 a quantization factor q^ccs which is a function of CCS and the original fixed quantizer factor, 

called ^_^e^ : 

q^ccs^qJixed'^f(CCS), 

where/(? is a function resulting in positive integers 0 (q^jnax-qjixed) to inraease q^ccs 

fiom qjixedyxpto an allowed maximum q_max. Exanqples fotfQ are Jl (CCS) — round ( CCS 

20 * (q^piax-qjixed) / (CCS^max) ) oxf2(CCS) = round ( (q^max -^Jbced-^l) ^ 
(CCS/CCS^max) -I) for CCS=0..CCSL7wax . 

In closed-^loop coding, the quantization factor q^adapt is adapted in order to achieve a 
desired predefined bit rate. Bit-rate controllers that are required for closed- bop coding work 
basically with bit budgets and chose q^adapt based on the available budget This means that 

25 the quantization factor q_css as described for open- loop coding can be used, and only qjixed 
has to be replaced with q_adapt. Then, compared to an umnodified rate controller, the bit 
budget will increase with higher CCS, and these additional bits are autonoatically spent on 
frames with lower CCS, because fee q_ddqpt value will decrease due to the iQcreased bit 
budget. 
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CLAIMS : 

1 . A video encoding method provided for encoding an input image sequence consisting 
of successive groups of frames in which each ftame is itself subdivided into blocks, said 

metiiod coniprising the steps of : 
5 - preprocessing said sequence on the basis of a so-called content-changp strength 

(CCS) computed for each fiame by applying some predetennined rules ; 

- estimating a motion vector for each block of the frames ; 

- generating a predicted frame using said motion vectors respectively associated to the 

blocks of the current frame ; 
10 - applying to a difference signal between the current frame and the last predicted 

frame a transformation sub-step producing a pluraUty of coefficients and followed by a 
quantization sub-step of said coefficients ; 

- coding said quantized coefficients ; 

wherein said CCS is iised in said quantization sub-step for modifying the quantization factor, 
15 said CCS and the quantization fector increasing or decreasing simultaneously. 

2. A video encoding device provided for encoding an input image sequence consisting 
of successive groups of frames in which each fiame is itself subdivided into blocks, said ; 
device comprising the following means : 

- preprocessing means, provided for preprocessing said sequence on the basis of a so- 
20 caUed content-change strength (CCS) computed for each fiame by applying some 

predetermined rules ; 

- estimating means, provided for estimating a motion vector for each block of the 

fiames ; 

- generating means, provided for generating a predicted fi^e on the basis of said 
25 motion vectors respectively associated to the blocks of the current frame ; 

- transforming and quantizing means, provided for applying to a difference signal 
between die current frame and the last predicted fiame a ttansfonnadon producing a pluraUty 
of coefficients and followed by a quantization of said coefficients ; 

- coding means, provided for encoding said quantized coefficients ; 

30 vi^erein an output of said preprocessing means is received on an input of said transformation 
and quantization means for modifying the quantization factor on the basis of said CCS, said 
CCS and tiie quantization fector increasing or decreasing simultaneously. 
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Abstract 

The invention relates to a video encoding method provided for encoding an input 
image sequence consisting of successive groups of firames in which each fiame is itself 
subdivided into blocks, and to the corresponding video encoding device. This method and 
device perform the steps of preprocessing the sequence on the basis of a so-called content- 
change strength (CCS) computed for each frame, generating a predicted frame using said 
motion vectors estimated for each block, applying to a difference signal between the current 
frame and the last predicted fiame a transfoimation sub-step producing a plurality of 
coefficients and foUowed by a quantization sub-step of said coefficients, and coding said 
quantized coefficients. According to the invention, said CCS is used in said quantization sub- 
step for modifying the quantization fector, said CCS and the quantization factor increasing or 
decreasing simultaneously. 
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